Artificial Neural Network & Mel-Frequency Cepstrum Coefficients-Based Speaker Recognition
نویسندگان
چکیده
Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker’s voice to verify their identity and control access to services such as voice dialing, banking by telephone, telephone shopping, database access services, information services, voice mail, security control for confidential information areas, and remote access to computers. This document demonstrates how a speaker recognition system can be designed by artificial neural network using MelFrequency Cepstrum Coefficients of voice signal. Note that the training process did not consist of a single call to a training function. Instead, the network was trained several times on various input ideal and noisy signals coded by MelFrequency Cepstrum Coefficients, the signals which contents voices. In this case training a network on different sets of noisy signals forced the network to learn how to deal with noise, a common problem in the real world.
منابع مشابه
Throat Microphone for Speaker Recognition Using AANN
In this paper, we have analyzed the performance of speaker recognition system based on features extracted from the speech recorded using throat microphone in clean and noisy environment. In general, clean speech performs better for speaker recognition system. Speaker recognition in noisy environment, using transducer held at the throat results in a signal that is clean even in noisy. This speak...
متن کاملSignificance of formants from difference spectrum for speaker identification
In this paper, we describe a prototype speaker identification system using auto-associative neural network (AANN) and formant features. Our experiments demonstrate that formants extracted from difference spectrum perform significantly better than formants extracted from normal spectrum for the task of speaker identification. We also demonstrate that formants from difference spectrum provide com...
متن کاملA Novel Approach for Text-Independent Speaker Identification Using Artificial Neural Network
This article presents the implementation of Text Independent Speaker Identification system. It involves two parts“Speech Signal Processing” and “Artificial Neural Network”. The speech signal processing uses Mel Frequency Cepstral Coefficients (MFCC) acquisition algorithm that extracts features from the speech signal, which are actually the vectors of coefficients. The backpropagation algorithm ...
متن کاملMel Frequency Cepstral Coefficients for Speaker Recognition Using Gaussian Mixture Model-Artificial Neural Network Model
Speaker Recognition (SP) is a topic of great significance in areas of intelligent and security. In Biometric SP using automated method of verifying or recognizing the identity of the person on the basis of some application, such as a finger print or face pattern and human voice. Many method have been proposed in the literature are focusing on front end processing such as PLP and LPC. In this pa...
متن کاملScore Level Fusion Based Personal Authentication Using Fingerprint and Speech
In this paper development of a multimodal based biometric fusion system is discussed. A fingerprint recognition system is developed using global singularity features. Mel-frequency Cepstral Coefficients are used to recognise a speaker using the backpropagation artificial neural network. A score level fusion based recognition system is developed using fingerprint and speech match scores and the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005